Overview

Dataset Statistics

Number of Variables 14
Number of Rows 603
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 175.7 KB
Average Row Size in Memory 298.3 B
Variable Types
  • Categorical: 3
  • Numerical: 11

Dataset Insights

year is skewed Skewed
Tempo is skewed Skewed
Loudness is skewed Skewed
Liveness is skewed Skewed
Acousticness is skewed Skewed
Speechiness is skewed Skewed
title has a high cardinality: 584 distinct values High Cardinality
artist has a high cardinality: 184 distinct values High Cardinality
Acousticness has 73 (12.11%) zeros Zeros

Variables


title

categorical

Approximate Distinct Count 584
Approximate Unique (%) 96.9%
Missing 0
Missing (%) 0.0%
Memory Size 51297

Length

Mean 18.2919
Standard Deviation 13.9561
Median 15
Minimum 1
Maximum 97

Sample

1st row Hey, Soul Sister
2nd row Love The Way You L...
3rd row TiK ToK
4th row Bad Romance
5th row Just the Way You A...

Letter

Count 8929
Lowercase Letter 6950
Space Separator 1514
Uppercase Letter 1979
Dash Punctuation 54
Decimal Number 29

artist

categorical

Approximate Distinct Count 184
Approximate Unique (%) 30.5%
Missing 0
Missing (%) 0.0%
Memory Size 45543

Length

Mean 10.5274
Standard Deviation 3.9051
Median 11
Minimum 2
Maximum 24

Sample

1st row Train
2nd row Eminem
3rd row Kesha
4th row Lady Gaga
5th row Bruno Mars

Letter

Count 5805
Lowercase Letter 4658
Space Separator 489
Uppercase Letter 1147
Dash Punctuation 3
Decimal Number 21

Genre

categorical

Approximate Distinct Count 50
Approximate Unique (%) 8.3%
Missing 0
Missing (%) 0.0%
Memory Size 44758
  • The largest value (dance pop) is over 5.45 times larger than the second largest value (pop)

Length

Mean 9.2255
Standard Deviation 3.4069
Median 9
Minimum 3
Maximum 25

Sample

1st row neo mellow
2nd row detroit hip hop
3rd row dance pop
4th row dance pop
5th row pop

Letter

Count 5026
Lowercase Letter 5026
Space Separator 524
Uppercase Letter 0
Dash Punctuation 3
Decimal Number 0
  • The top 2 categories (dance pop, pop) take over 50.0%

year

numerical

Approximate Distinct Count 10
Approximate Unique (%) 1.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9648
Mean 2014.592
Minimum 2010
Maximum 2019
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • year is skewed left (γ1 = -0.1855)

Quantile Statistics

Minimum 2010
5-th Percentile 2010
Q1 2013
Median 2015
Q3 2017
95-th Percentile 2018.9
Maximum 2019
Range 9
IQR 4

Descriptive Statistics

Mean 2014.592
Standard Deviation 2.6071
Variance 6.7967
Sum 1.2148e+06
Skewness -0.1855
Kurtosis -0.9692
Coefficient of Variation 0.001294
  • year is not normally distributed (p-value 2.5379910341298425e-06)

Tempo

numerical

Approximate Distinct Count 104
Approximate Unique (%) 17.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9648
Mean 0.5755
Minimum 0
Maximum 1
Zeros 1
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • Tempo is skewed right (γ1 = 0.5357)

Quantile Statistics

Minimum 0
5-th Percentile 0.4029
Q1 0.4854
Median 0.5825
Q3 0.6262
95-th Percentile 0.8092
Maximum 1
Range 1
IQR 0.1408

Descriptive Statistics

Mean 0.5755
Standard Deviation 0.1204
Variance 0.01449
Sum 347.0049
Skewness 0.5357
Kurtosis 1.6851
Coefficient of Variation 0.2092
  • Tempo is not normally distributed (p-value 1.0777911099391033e-07)
  • Tempo has 28 outliers

Energy

numerical

Approximate Distinct Count 77
Approximate Unique (%) 12.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9648
Mean 0.7194
Minimum 0
Maximum 1
Zeros 1
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • Energy is skewed left (γ1 = -0.9827)

Quantile Statistics

Minimum 0
5-th Percentile 0.3888
Q1 0.6224
Median 0.7551
Q3 0.8367
95-th Percentile 0.9388
Maximum 1
Range 1
IQR 0.2143

Descriptive Statistics

Mean 0.7194
Standard Deviation 0.1664
Variance 0.0277
Sum 433.8163
Skewness -0.9827
Kurtosis 1.0294
Coefficient of Variation 0.2313
  • Energy has 11 outliers

Danceability

numerical

Approximate Distinct Count 70
Approximate Unique (%) 11.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9648
Mean 0.6637
Minimum 0
Maximum 1
Zeros 1
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • Danceability is skewed left (γ1 = -0.6779)

Quantile Statistics

Minimum 0
5-th Percentile 0.3928
Q1 0.5876
Median 0.6804
Q3 0.7526
95-th Percentile 0.8557
Maximum 1
Range 1
IQR 0.1649

Descriptive Statistics

Mean 0.6637
Standard Deviation 0.1379
Variance 0.01902
Sum 400.2165
Skewness -0.6779
Kurtosis 1.0454
Coefficient of Variation 0.2078
  • Danceability has 18 outliers

Loudness

numerical

Approximate Distinct Count 14
Approximate Unique (%) 2.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9648
Mean 0.9383
Minimum 0
Maximum 1
Zeros 1
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • Loudness is skewed left (γ1 = -12.4187)

Quantile Statistics

Minimum 0
5-th Percentile 0.881
Q1 0.9138
Median 0.9483
Q3 0.9655
95-th Percentile 0.9828
Maximum 1
Range 1
IQR 0.05172

Descriptive Statistics

Mean 0.9383
Standard Deviation 0.04824
Variance 0.002327
Sum 565.7931
Skewness -12.4187
Kurtosis 235.8279
Coefficient of Variation 0.05141
  • Loudness is not normally distributed (p-value 5.41806832511503e-12)
  • Loudness has 4 outliers

Liveness

numerical

Approximate Distinct Count 61
Approximate Unique (%) 10.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9648
Mean 0.2402
Minimum 0
Maximum 1
Zeros 1
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • Liveness is skewed right (γ1 = 1.7118)

Quantile Statistics

Minimum 0
5-th Percentile 0.08108
Q1 0.1216
Median 0.1622
Q3 0.3243
95-th Percentile 0.5797
Maximum 1
Range 1
IQR 0.2027

Descriptive Statistics

Mean 0.2402
Standard Deviation 0.1771
Variance 0.03135
Sum 144.8378
Skewness 1.7118
Kurtosis 3.0757
Coefficient of Variation 0.7372
  • Liveness is not normally distributed (p-value 4.98749947069674e-10)
  • Liveness has 23 outliers

Valence

numerical

Approximate Distinct Count 94
Approximate Unique (%) 15.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9648
Mean 0.5329
Minimum 0
Maximum 1
Zeros 1
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • Valence is skewed left (γ1 = -0.0737)

Quantile Statistics

Minimum 0
5-th Percentile 0.1439
Q1 0.3571
Median 0.5306
Q3 0.7041
95-th Percentile 0.898
Maximum 1
Range 1
IQR 0.3469

Descriptive Statistics

Mean 0.5329
Standard Deviation 0.2297
Variance 0.05277
Sum 321.3469
Skewness -0.07372
Kurtosis -0.8251
Coefficient of Variation 0.4311

Duration

numerical

Approximate Distinct Count 144
Approximate Unique (%) 23.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9648
Mean 0.3127
Minimum 0
Maximum 1
Zeros 1
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • Duration is skewed right (γ1 = 1.3362)

Quantile Statistics

Minimum 0
5-th Percentile 0.1624
Q1 0.2345
Median 0.3
Q3 0.3638
95-th Percentile 0.5203
Maximum 1
Range 1
IQR 0.1293

Descriptive Statistics

Mean 0.3127
Standard Deviation 0.1177
Variance 0.01385
Sum 188.5414
Skewness 1.3362
Kurtosis 4.1346
Coefficient of Variation 0.3764
  • Duration is not normally distributed (p-value 0.000238768284176149)
  • Duration has 19 outliers

Acousticness

numerical

Approximate Distinct Count 75
Approximate Unique (%) 12.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9648
Mean 0.1447
Minimum 0
Maximum 1
Zeros 73
Zeros (%) 12.1%
Negatives 0
Negatives (%) 0.0%
  • Acousticness is skewed right (γ1 = 2.1971)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.0101
Median 0.06061
Q3 0.1717
95-th Percentile 0.6364
Maximum 1
Range 1
IQR 0.1616

Descriptive Statistics

Mean 0.1447
Standard Deviation 0.2098
Variance 0.044
Sum 87.2626
Skewness 2.1971
Kurtosis 4.4286
Coefficient of Variation 1.4495
  • Acousticness is not normally distributed (p-value 5.486861159803676e-16)
  • Acousticness has 61 outliers

Speechiness

numerical

Approximate Distinct Count 39
Approximate Unique (%) 6.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9648
Mean 0.1741
Minimum 0
Maximum 1
Zeros 1
Zeros (%) 0.2%
Negatives 0
Negatives (%) 0.0%
  • Speechiness is skewed right (γ1 = 2.5343)

Quantile Statistics

Minimum 0
5-th Percentile 0.0625
Q1 0.08333
Median 0.1042
Q3 0.1875
95-th Percentile 0.5
Maximum 1
Range 1
IQR 0.1042

Descriptive Statistics

Mean 0.1741
Standard Deviation 0.1559
Variance 0.0243
Sum 105
Skewness 2.5343
Kurtosis 7.0277
Coefficient of Variation 0.8953
  • Speechiness is not normally distributed (p-value 7.876964072726992e-12)
  • Speechiness has 66 outliers

Popularity

numerical

Approximate Distinct Count 71
Approximate Unique (%) 11.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9648
Mean 0.6719
Minimum 0
Maximum 1
Zeros 5
Zeros (%) 0.8%
Negatives 0
Negatives (%) 0.0%
  • Popularity is skewed left (γ1 = -1.4241)

Quantile Statistics

Minimum 0
5-th Percentile 0.3848
Q1 0.6061
Median 0.697
Q3 0.7677
95-th Percentile 0.8485
Maximum 1
Range 1
IQR 0.1616

Descriptive Statistics

Mean 0.6719
Standard Deviation 0.1466
Variance 0.0215
Sum 405.1717
Skewness -1.4241
Kurtosis 3.559
Coefficient of Variation 0.2182
  • Popularity is not normally distributed (p-value 0.0007659743816242446)
  • Popularity has 25 outliers

Interactions

Correlations

Missing Values